Dynamic Adaptive Streaming using Index-Based Learning Algorithms
نویسندگان
چکیده
We provide a unified framework using which we design scalable dynamic adaptive video streaming algorithms based on index based policies (dubbed DASIP Fig. 2) to maximize the Quality of Experience (QoE) provided to clients using video streaming services. Due to the distributed nature of our algorithm DAS-IP, it can be easily implemented in lieu of popular existing Dynamic Adaptive Streaming over HTTP (DASH) algorithm which is used by various Cloud based video streaming services, Content Delivery Networks (CDNs), Cache networks, wireless networks, vehicular networks etc. We begin by considering the simplest set-up of a onehop wireless network in which an Access Point (AP) transmits video packets to multiple clients over a shared unreliable channel. The video file meant for each client has been fragmented into several packets, and the server maintains multiple copies (each of different quality) of the same video file. Clients maintain individual packet buffers in order to mitigate the effect of uncertainty on video iterruption. Streaming experience, or the Quality of Experience (QoE) of a client depends on several factors: i) starvation/outage probability, i.e., average time duration for which the client does not play video because the buffer is empty, ii) average video quality, iii) average number of starvation periods, iv) temporal variations in video quality etc. We pose the problem of making dynamic streaming decisions in order to maximize the total QoE as a Constrained Markov Decision Process (CMDP). A consideration of the associated dual MDP suggests us that the problem is vastly simplified if the AP is allowed to charge a price per unit bandwidth usage from the clients. More concretely, a “client-by-client” QoE optimization leads to the networkwide QoE maximization, and thus provides us a decentralized streaming algorithm. This enables the clients to themselves decide the optimal streaming choices in each time-slot, and yields us a much desired client-level adaptation algorithm. The optimal policy has an appealing simple threshold structure, in which the decision to choose the video-quality and power-level of transmission depends solely on the buffer-level. In case the clients are unaware of their (possibly random and timevarying) system parmeters, we develop algorithms that learn the indices while utilizing the strucure of the optimal decentralized policy. The decentralized nature of optimal policy implies that the DAS-IP has a much “smaller” policy space to explore from, and hence converges fast.
منابع مشابه
Lot Streaming in No-wait Multi Product Flowshop Considering Sequence Dependent Setup Times and Position Based Learning Factors
This paper considers a no-wait multi product flowshop scheduling problem with sequence dependent setup times. Lot streaming divide the lots of products into portions called sublots in order to reduce the lead times and work-in-process, and increase the machine utilization rates. The objective is to minimize the makespan. To clarify the system, mathematical model of the problem is presented. Sin...
متن کاملReinforcement learning algorithms for DASH video streaming
Dynamic Adaptive Streaming over HTTP (DASH) is a video streaming standard developed in 2011; the servers have several copies of every video at different bitrates, leaving the clients complete freedom to choose the bitrate of each segment and adapt to the available bandwidth. The research on client-side strategies to optimize user Quality of Experience (QoE) is ongoing; one of the most promising...
متن کاملA Self-organizing Multi-agent System for Adaptive Continuous Unsupervised Learning in Complex Uncertain Environments
Introduction. Continuous learning and online decisionmaking in complex dynamic environments under conditions of uncertainty and limited computational recourses represent one of the most challenging problems for developing robust intelligent systems. The existing task of unsupervised clustering in statistical learning requires the maximizing (or minimizing) of a certain similarity-based objectiv...
متن کاملOnline Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کاملServer-based and Network-assisted Solutions for Adaptive Video Streaming
Koffka Khan Department of Computing and Information Technology The University of the West Indies, Trinidad and Tobago, W.I Email: [email protected] Wayne Goodridge Department of Computing and Information Technology The University of the West Indies, Trinidad and Tobago, W.I Email: [email protected] -------------------------------------------------------------------ABSTRACT----...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1612.05864 شماره
صفحات -
تاریخ انتشار 2016